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Abstract 



Given a discrete time sample X\, . . . Xn from a Levy process X = 
{Xt)t>o of a finite jump activity, we study the problem of nonpara- 
metric estimation of the characteristic triplet (7,cr^,/5) corresponding 
to the process X. Based on Fourier inversion and kernel smoothing, 

CS ' we propose estimators of 7, a^ and p and study their asymptotic be- 

haviour. The obtained results include derivation of upper bounds on 
the mean square error of the estimators of 7 and a'^ and an upper 

^p , bound on the mean integrated square error of an estimator of p. 

. I Keywords: Characteristic triplet; Fourier inversion; kernel smoothing; 

r^ I Levy density; Levy process; mean integrated square error; mean square 

error. 
AMS subject classification: 62G07, 62G20 



1 Introduction 

Levy processes are stochastic processes with stationary independent incre- 
ments. The class of such processes is extremely rich, the best known rep- 
resentatives being Poisson and compound Poisson processes, Brownian mo- 
tion, Cauchy process and, more generally, stable processes. Though the 
basic properties of Levy pro cesses have been well-studied and understood 
since a long time, see e.g. [29|, during the last years there has been a renais- 
sance of interest in Levy processes. This revival of interest is mainly due to 
the fact that Levy processes found numerous applications in practice and 
proved to be useful in a broad range of fields, including finance, insurance, 
queueing, telecommunications, quantum theory, extreme value theory and 



many others, see e.g. |3(] for an overview. 13|] provides a thorough treatment 
of applications of Levy processes in finance. Comprehensive modern texts 
on fundamentals of Levy processes are [y, 1231 . |27I | , and we refer to those for 



precise definitions and more details concerning properties of Levy processes. 
Already from the outset an intimate relation of Levy processes with 
infinitely divisible distributions was discovered. For a detailed exposition of 
infinitely divisible distributions see e.g. [30]. In fact there is a one-to-one 
correspondence between Levy processes and infinitely divisible distributions: 
if X = {Xt)t>Q is a Levy process, then its marginal distributions are all 
infinitely divisible and are determined by the distribution of Xi . Conversely, 
given an infinitely divisible distribution ^u, one can construct a Levy process, 
such that Pxi = /^- The celebrated Levy-Khintchine formula for infinitely 
divisible distributions provides us with an expression for the characteristic 
function of Xi, which can be written as 



4>x^ {z) = exp 



-*- .2 2 
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i'^z - -o z^ + / (e*^^' - 1 - izxl[_i^i](x))z^((ix) 



fll 



where 7 € M, o" > and z/ is a measure concentrated on M\{0}, such that 
Jjj(l A x^)v(dx) < cxD. This measure is called the Levy measure correspond- 
ing to the Levy process X, while the triple (7,(7^,1') is referred to as the 
characteristic or Levy triplet of X. The representation in ([1]) in terms of the 
triplet (7, (T^, z/) is unique. Thus the Levy triplet provides us with means for 
unique characterisation of a law of any Levy process. Bearing this in mind, 
the statistical inference for Levy processes can be reduced to inference on the 
characteristic triplet. There are several ways to approach estimation prob- 
lems for Levy processes: parametric, nonparametric and semiparametric ap- 
proaches. These approaches depend on whether one decides to parametrise 
the Levy measure (or its density, in case it exists) with a Euclidean param- 
eter, or to work in a nonparametric setting. A semiparametric approach to 
parametrisation of the Levy measure is also possible. Most of the existing 
literature dealing with estimation problems for Levy processes is concerned 
with parametric estimation of the Levy measure (or its density, in case it 



exists), see e.g. [l|, I4], where a fairly general setting is considered. There 
are relatively few papers that study nonparametric inference procedures for 
Levy processes, and the majority of them assume that high frequency data 
are available, i.e. either a Levy process is observed continuously over a time 
interval [0,T] with T ^ cxd, or it is observed at equidistant time points 



An, ■■■ , nAn and lim„^oo A„ = 0, lim„,^oo nAn = 00, see e.g. [J, l2l|, 123] . 
On the other hand it is equally interesting to study estimation problems for 
the case when the high frequency data are not available, i.e. when A„ = A 
is kept fixed. The latter case is more involved due to the fact that the 
information on the Levy measure is contained in jumps of the process X 
and impossibility to observe them directly as in the case of a continuous 
record of observations, or to 'disentangle' them from the Brownian motion 
as in the high frequency data setting, makes the estimation problem rather 
difficult. In the particular context of a compound Poisson process we men- 
tion [71, la, lla], where given a sample Yi, . . . , y„ from a compound Poisson 
process Y = {Yt)t>o, nonparametric estimators of the jump size distribu- 
tion function F (see [7|, |8|) and its density / (see ISj) were proposed and 



their asymptotics were studied as n — > 00. This problem is referred to as 
decompounding. Nonparametric estimation of the Levy measure u based 
on low frequency observations from a general Levy process X was studied 



in [25l . l35l |. However, these papers treat the case of estimation of the Levy 
measure only (or of the canonical function K in case of [351 ]) and not of 
its density. Moreover, the rates of convergence of the proposed estimators 
are studied under the strong moment condition E []Xi]^+ ] < 00, where <5 is 
some strictly positive number. This condition automatically excludes distri- 
butions with heavy tails. Nonparametric estimation of the Levy density of a 
pure jump Levy process (i.e. a Levy process without a drift and a Brownian 
component) was considered in (j^. We refer to those papers for additional 
details. 

In the present work we concentrate on nonparametric inference for Levy 
processes that are of finite jump activity and have absolutely continuous 
Levy measures. In essence this means that we consider a superposition of 
a compound Poisson process and an independent Brownian motion. The 
Levy-Khintchine formula in our case takes the form 

1 



(t>Xi (z) = exp 



i-fz - -a'z^ + / (e*^^ - l)p{x)dx 



(2) 



where the Levy density p is such that A := f^^p{x)dx < 00. To keep the 
notation compact, we again use 7 to denote the drift coefficient in ([2]), even 
though it is in general different from 7 in ([T]). Observe that the process 



X is related to Merton's jump-diffusion model of an asset price, see 24]. 



Additional details on exponential Levy models, of which Merton's model is 



a particular case, can be found e.g. in [l3[] 



Suppose that we dispose a sample Xa , -^2 A , • • • , XnA from the process 
X. By a rescahng argument, without loss of generality, we may take A = 1. 
Based on this sample, our goal is to infer the characteristic triplet (7, o"^, p), 
corresponding to ([2]). At this point we mention that a problem related 
to ours was studied in [5(. There an exponential of the process X (this 
exponential models evolution of an asset price over time) was considered 
and inference was drawn on parameters o", A and 7 and and the functional 
parameter, the Levy density p, based on observations on prices of vanilla 
options on this asset. The difference of our estimation problem with this 
problem is the observation scheme, since we observe directly the process X. 
Moreover, existence of an exponential moment of X was assumed in [5| (this 
is unavoidable in the financial setting, because otherwise one cannot price 
financial derivatives). 

Our estimators of 7, A and cj'^ will be based on ([2]) and the use of a plug- 
in device. To estimate p, we will use methods developed in nonparametric 
density estimation based on i.i.d. observations, in particular we will employ 
the Fourier inversion approach and kernel smoothing, see e.g. Sections 6.3 



and 10.1 in [3j] for an overview. In fact by the stationary independent 



increments property of a Levy process, see Definition 1.6 in 27|, the problem 
of estimating (7, o"^,/?) from a discrete time sample Xi,...,X„ from the 
process X is equivalent to the following one (to keep the notation compact, 
we again use X's to denote our observations): let Xi, . . . , X„ be i.i.d. copies 
of a random variable X with characteristic function given by ([2]) (in the 
sequel we will use X to denote a generic observation). Based on these 
observations, the problem is to construct estimators of 7, a"^ and p. We 
henceforth will concentrate on this equivalent problem. 

The rest of the paper is organised as follows: in Section [2] we construct 
consistent estimators of parameters a^,X and 7. In Section [3l using the 
estimators of (T^,A and 7, we propose a plug-in type estimator for p and 
study the behaviour of its mean integrated square error. In Section H] we 
derive a lower bound for estimation of p. All the proofs are collected in 
Section [5l 

2 Estimation of a^ A and 7 

In the sequel we will find it convenient to use the jump size density f{x) := 
p{x)/X. We first formulate conditions on p,a and 7, that will be used 
throughout the paper. 



Condition 2.1. Let the unknown density p belong to the class 

f-OO 

W{P, L,A,K) = <p : p{x) = Xf{x), f is a density, / x^f{x)dx < K, 

/oo 
\t\^\(t)f{t)\dt<L,\(^ (0,A][, 

where /?, L, A and K are strictly positive numbers. 

This condition implies in particular that the Fourier transform (i)p{t) = 
X(f)f{t) of p is integrable. The latter is natural in light of the fact that our 
estimation procedure for p will be based on Fourier inversion, see Section O 
The integrability of 4>p implies that p is bounded and continuous. It follows 
that / is bounded and continuous, and hence, being a probability density, 
it is also square integrable. Therefore p{x) = A/(x) is square integrable as 
well. This again is a natural assumption, because we will select the mean 
integrated square error as a performance criterion for our estimator of p. 
The condition A > ensures that the process X has a compound Poisson 
component. Restriction of the class of densities / to those densities that 
have the finite second moment is needed to ensure that E [X^] is bounded 
from above uniformly in p, 7 and a. The latter is a technical condition used 
in the proofs. 

Condition 2.2. Let a be such that a £ (0,S], where T, is a strictly positive 
number. 

This is not a restrictive assumption in many applications, since for in- 
stance in the financial context a, which models volatility, typically belongs 
to some bounded set, e.g. a compact [0, S] as in [^]- The condition a > 
in our case ensures that X has a Brownian component. If u = 0, then our 



problem in essence reduces to the one studied in J18l |. 



Condition 2.3. Let 7 be such that \'y\ < P, where P denotes a positive 
number. 

Remarks similar to those we made after Condition 12.21 apply in this case 
as well. 

Next we turn to the construction of estimators of u^, A and 7. The ideas 
we use resemble those in [5|]. Let ^{z) and 9(z) denote the real and the 
imaginary parts of a complex number z, respectively. From ([2]) we have 

log (|0x(t)|) = -A + A5R(0/(t)) - ^. (3) 

Here we used the fact that 



log 



Mf{t) 



logfe^«(</'/W)Ulog 



J^^i'Pfit)) 



mH(t))- 



Let v'^ be a kernel that depends on a bandwidth h and is such that 
v^{t)dt = 0, / 

-l/h J-l/h 

Observe that unhke kernels in kernel density estimation, see e.g. Definition 
1.3 in 31 1, the function v^ does not integrate to one and by calling it a 



/: 



fl/h / ^2\ 

v^{t)dt = 0, / (--]v''it)dt = l. 
J^i/h \ 2 y 



kernel we abuse the terminology. In view of ([3]) 

l/h i-l/h 

\og{\(t)x{t)\)v^\t)dt = A / ^{^f{t))v^{t)dt + a^. (4) 

-l/h J-l/h 

Provided enough assumptions on v , one can achieve that the right-hand 
side of dH) tends to ct^ as /i — > 0. A natural way to construct an estimator 
of (T^ then is to replace in Q log{\(pxit)\) by its estimator log ( |0emp (*)!)• 
Consequently, we propose 



rl/h 

al= / max{min{M„,log(|(/)e„ip(t)|)},-M„}t;'^(f)(it (5) 

J-llh 



as an estimator of a"^. Here M„ denotes a sequence of positive numbers 
diverging to infinity at a suitable rate. The truncation in (j5]) is introduced 
due to technical reasons in order to obtain a consistent estimator. 

We now state our assumptions on the kernel Vh-, the bandwidth h and 
the sequence M = (M„)„>i. 

Condition 2.4. Let the kernel v^{t) = h^v{ht), where the function v is 
continuous and real-valued, has a support on [—1, 1] and is such that 



v{t)dt = 0, / (- — ]v{t)dt = l, v{t) = 0{t'^) as t ^ Q. 



"1 /■! / +2 

1-1 
Here (3 is the same as in Condition \2.1\ 

Condition 2.5. Let the bandwidth h depend on n and he such that h^ = 
(77 log n)"-*^/^ with < rj < S^^. 

Using a default convention in kernel density estimation, we will suppress 
the index n when writing hn, since no ambiguity will arise. Condition 12.51 
implies that ne ' — > 00, since the logarithm of the left-hand side of this 
expression diverges to minus infinity. Condition 12.51 is required to establish 
consistency of estimators of (T^,A,7 and p. Hence it is of the asymptotic 
nature. For finite samples of moderate size, however, it might lead to un- 



satisfactory estimates. A separate simulation study in the spirit of [l7|) is 



needed to study possible bandwidth selection methods in practical problems. 



Condition 2.6. Let the truncating sequence M = (M„)„>i be such that 
Mn = mnh~'^, where rUn is a sequence of real numbers diverging to plus 
infinity at a slower rate than log n, for instance nin = log log n. 

Other restrictions on M are also possible. 

In the sequel we will frequently employ the symbol < and >, meaning 
'less or equal up to a universal constant', or 'greater or equal up to a universal 
constant', respectively. The following theorem establishes consistency of a^. 

Theorem 2.1. Let Conditions \2. 1\W^ be satisfied and let the estimator a^ 
be defined by ([5]). Then 

sup sup sup E[(a^-fj2)2] < (logn)-^-3. 

|7|<ro-e(0,S] p&W{l3,L,A,K) 

To construct an estimator of the jump intensity A, we will again use ^, 
but now in a different way. Let u denote a kernel that depends on h and 
is such that 

~l/h j-l/h 

u^{t)dt = -1, / t^u^{t)dt = 0. 

^l/h J-l/h 

Then 

l/h j-l/h 

\og{\(^x{t)\)u^{t)dt = A + A / ^{(pf{t))u''{t)dt. (6) 

'l/h J-l/h 

With a proper selection of u one can ensure that ([6]) converges to A as 

h ^ 0. Using a plug-in device, we therefore propose the following estimator 

of A: 

/■^/^ 
A„= / max{mm{Mn,log{\(pempit)\)},-Mn}u^{t)dt. 

J-l/h 

Now we state a condition on the kernel u . 

Condition 2.7. Let the kernel u (t) = hu{ht), where the function u is 
continuous and real-valued, has a support on [—1, 1] and is such that 

1 r-l 

u{t)dt = -1, / t^u{t)dt = 0, u{t) = 0{t^) ast^O. 
1 J-i 



Here j3 is the same as in Condition \2.1[ 

The following theorem deals with asymptotics of the estimator A^ 



Theorem 2.2. Let Conditions \2.1W2.'J\ and \2.5\^7!\ be satisfied and let the 
estimator A„ be defined by ([6]) . Then 

sup sup sup E[(A„-A)2] < (logn)-^-^ 

|7|<ro-G(0,S] p£W{l3,L,A,K) 



Finally, we consider estimation of the drift coefficient 7. By (J2]) we have 

Q{Log{4>x{t)))=7t + XQ{4>f{t)), 

where Log{(j)x{t)) denotes the distinguished logarithm of the characteristic 
function (j)x{t), i-e. a logarithm that is a single- valued and continuous func- 
tion of t, such that Log(^js:(0)) = 0, see Theorem 7.6.2 in pjj] for details of 
its construction. Let w'^ denote a kernel that depends on h and is such that 

l/h 

tw^{t)dt = 1. 
-i/h 

Then 

l/h t-l/h 

Q{Logicl)x{t)))w^{t)dt = 7 + A / 'i{(Pfit))w^{t)dt. 

-l/h J-l/h 

With an appropriate choice of w the right-hand side will converge to 7. 
Therefore, by a plug-in device, for those a;'s from the underlying sample 
space ri for which the distinguished logarithm can be defined, we define an 
estimator of 7 as 



/ ma.x{mm{Q{Log{(l)empit))),Mn},-Mn}w''{t)dt, (7) 

J-l/h 



while for those w's for which it cannot be defined, we assign an arbitrary 
value to the distinguished logarithm in ([7]), e.g. zero. The distinguished 
logarithm in ([7]) can be defined only for those w's for which (pempii), as a 
function of t does not vanish on [— /i~^, /i~^], see Theorem 7.6.2 in [11,]. In 
fact the probability of the exceptional set, where the distinguished logarithm 
is undefined, tends to zero as n — > 00. We will show this by finding a set Bn, 
such that on this set the distinguished logarithm might be undefined, while 
on its complement B'^ it is necessarily well-defined. We have 



te[-h-^,h 



-1\-cy'^l{2\?) y g-2A-S2/(2/).2) 



inf |</.x(i)| > e-2^--'/(2^') > e-2A-^V(2/^^). (8) 

h-1 h-l] 



Define 



Bn=< sup {(pempit) -(j)x{t)\> S\ , 

[te[-h-\h-^ J 

f 1 

B^ = < sup \(l)emp{t)-(l)x{t)\<S} , 

[tel-h-\h-^ J 

with 5 = (l/2)e-2^-^'/(2/i'). From ([HD, © and Theorem 7.6.2 of HH it 
follows that on the set B^ the distinguished logarithm is well-defined (with 



t restricted to [—h^^,h^^]), since on this set (pemp cannot take the value 
zero. Notice that given our conditions on p and a, our choice of 6 is suitable 
whatever p,a and 7 are. All we need to show is that P(i?n) -^ 0. The 
following theorem holds true. 

Theorem 2.3. Let W{(3,L,A,K) be defined as in Condition \2.1[ Then 

sup sup sup P(S„) < -2-. 

\-y\<rae{0,T,]p(^W{f3,L,A,K) "■" 

Notice that by Condition 12.51 we have P(i?n) -^ 0. We now state a con- 
dition on the kernel w . 

Condition 2.8. Let the kernel w (t) = h'^w{ht), where the function w is 
continuous and real-valued, has a support on [—1, 1] and is such that 

tw{t)dt = 1, w{t) = Oit^^) ast^O. 

-1 

Here (3 is the same as in Condition \2.1\ 

The following result holds. 

Theorem 2.4. Let Conditions {MMIMIKMKE and\KE be satisfied and let 
the estimator 7„ be defined by ^ . Then 

sup sup sup E[(7„ -7)^] < (logn)"^"^, 

|7|<r ae{0,S] p£Wsy,^{l3,L,A,K) 

where Wsym{(3,L,A, K) denotes the class of symmetric Levy densities that 
belong to W{f3,L,A,K). 

The reason why we restrict ourselves to the class of symmetric Levy 
densities is that we would like to obtain a uniformly consistent estimator of 
7 (and eventually of p, see Section [3|). The main technical difficulty in this 
respect is the (uniform) control of the argument (i.e. of the imaginary part) 
of the distinguished logarithm in ([7]), see the proofs of Theorems l2.4l and l2.51 
For transparency purposes we restrict ourselves to the class of symmetric 
p's. If we are only interested in the consistency of the estimator for a fixed 
p, then the above restriction is not needed and the result holds without it. 
We formulate the corresponding theorem below. 

Theorem 2.5. Let Conditions \2.5lW^ and [278\ be satisfied. Furthermore, 
let 7 G M, o"^ > and let p be such that 

/OO /"OO 

x^f{x)dx < 00; / \t\'^\(t)f{t)\dt < 00. (10) 

-00 J —00 

Let the estimator 7„ be defined by ([7]) . Then 

E[(7n-7)']<(logn)-^-^ 

Now that we obtained uniformly consistent estimators of cr^, A and 7, we 
can move to the construction of an estimator of p. 



3 Estimation of p 

The method that will be used to construct an estimator of p is based on 
Fourier inversion and is similar to the approach in [18t |. Solving for (f)p in 
(ED, we get 



Mi) = Log ( „.,.::;,:.,./, ) . (n) 



Here Log again denotes the distinguished logarithm, which can be con- 



structed as in Theorem 7.6.2 of [111] taking into account an obvious difference 
that in our case the function e^f^^' equals e'^ at t = 0. 
By Fourier inversion we have 

This expression will be used as the basis for construction of an estimator of p. 
Let khe a symmetric kernel with Fourier transform (pk supported on [—1, 1] 
and nonzero there, and let /i > be a bandwidth. Since the characteristic 
function (jjx is integrable, there exists a density q of X, and moreover, it is 
continuous and bounded. This density can be estimated by a kernel density 
estimator 






see e.g. 3l|, |3j] for an introduction to kernel density estimation. Its charac- 
teristic function 4'empit)(pkiht) will then serve as an estimator of (j)x{t)- For 
those w's from the sample space fi, for which the distinguished logarithm 
in the integral below is well-defined, p can be estimated by the plug-in type 
estimator, 

1 f^''' .-itxj^J <Pempit)Mht) 



p„(x) = — / e-'*-' Log ^--^P^y^-y- dt, (12) 

while for those w's, for which the distinguished logarithm cannot be defined, 
we can assign an arbitrary value to Pnix), e.g. zero. Notice that the estimator 
(|12p is real-valued, which can be seen by changing the integration variable 
from t into —t. 

Our definition of the estimator is quite intuitive, however in order to 
investigate its asymptotic behaviour, some modifications are due: we need 



10 



to introduce truncation in the definition of p„ and consequently, we propose 
27r 7-i/h 27r J_i/h 2 27r J_i/h 



1 Z"-^'" 
+ / e-''''ma.x{mm{Mn,log{\<Pemp{t)(t>k{ht)\)},-Mn}dt 

1 f^^^ _• 
+ i— e **''max{min{M„,arg(0emp(i)'Afe(/iO)}>-^n}di 

2vr J_i//i 



(13) 



as an estimator of p{x). Here M = (M„)„>i denotes a sequence of positive 
numbers satisfying Condition 12.61 while log and arg are the real and imagi- 
nary parts of the distinguished logarithm, respectively. Notice that in (I13p 
we essentially truncate the real and imaginary parts of the distinguished 
logarithm from above and from below. The truncation is only necessary 
to make asymptotic arguments work and in practice we do not need to 
employ it. Observe that |/5„(x)p is integrable, since by Parseval's identity 
each summand in (llSp is square integrable. Furthermore, by Theorem 12.31 
the probability of the set, where the distinguished logarithm in (|13|) can be 
defined, tends to one as the sample size n tends to infinity. 

We now state a condition on the kernel k that will be used when studying 
asymptotics of /3„. 



vrx 



Condition 3.1. Let the kernel k he the sine kernel, k{x) = sinx/( 

The Fourier transform of the sine kernel is given by ipkit) = l[-i,i](0- 
The use of the sine kernel in our problem is equivalent to the use of the 
spectral cut-off method in [5| in a problem similar to ours. The sine kernel 
has been used successfully in kernel density estimation since a long time, 



see e.g. [15|, ll6|. An attractive feature of the sine kernel in ordinary kernel 
density estimation is that it is asymptotically optimal when one selects the 
mean square error or the mean integrated square error as the criterion of 
the performance of an estimator. Notice that the sine kernel is not Lebesgue 
integrable, but its square is. 

Now we will study the asymptotics of p„ . As a criterion of performance 
of the estimator pn we select the mean integrated square error 



MISE[p„] = E 



\f)n{x) - p{x)\'^dx 



Other possible choices include, for instance, the mean square error and the 
mean integrated error of the estimator. These are not discussed here. The 
theorem given below constitutes the main result of the paper. It provides an 
order bound on MISE[p„] over an appropriate class of characteristic triplets 
and demonstrates that the estimator pn is consistent in the MISE sense. 



11 



Theorem 3.1. Assume that assumptions of Theorem,s \2.1\\2^\ hold. Let the 
estmator pn he defined by ()13p . Then 

sup sup sup MISE[p„] < (logn)~^, 

where W*y^{(3,L,C,A, K) denotes the class of Levy densities p, such that 
p G Wsym{f3,L,A,K) and additionally 

|tp^|(/./(t)|^(it < C. 

The remark that we made after Theorem 12.41 apphes in this case as 
well: if we are willing to abandon the uniform convergence requirement, the 
similar upper bound as in Theorem 13.11 can be established for a fixed target 
density p without an assumption that it is necessarily symmetric. We state 
the corresponding theorem below. 



Theorem 3.2. Assume that Conditions \2.4\^^^ hold. Let A > 0, o" > and 



let p satisfy ([TO]) . In addition, suppose that 

\t\'^'^\(j)f{t)\^dt < oo. 



Let the estimator pn be defined by (jlSp . Then 

MISE[p„]<(logn)-'3. 

4 Lower bound for estimation of p 

In the previous section we showed that under certain smoothness assump- 
tions on the class of target densities p, the convergence rate of our estimator 
Pn is logarithmic. This convergence rate can be easily understood on an 
intuitive level when comparing our problem to a deconvolution problem, see 
e.g. Section 10.1 of [3^] for an introduction to deconvolution problems. A 
deconvolution problem consists of estimation of a density (or a distribution 
function) of a directly unobservable random variable Y based on i.i.d. copies 
Xi, . . . , Xn of a random variable X = Y + Z. The X's can be thought of as 
repetitive measurements of Y, which are corrupted by an additive measure- 
ment error Z. It is well-known that if the distribution of Z is normal, and 
if the class of the target densities is sufficiently large, e.g. some Holder class 



(see Definition 1.2 in [3l|]), the minimax convergence rate will be logarith- 
mic for both the mean squared error and mean integrated squared error as 
measures of risk, see [19|, |20| . We will prove a similar result for a problem 
of estimation of a Levy density p. 



12 



Theorem 4.1. Denote by T an arbitrary Levy triplet (7, cr^,/o), such that 
I7I < r,cr G (0,S],A G (0,A]. Furthermore, let 

|tp^|0/(t)|2dt < C (14) 

for (5 > 1/2. Let T be a collection of all such triplets. Then 

infsupMISE[p„]>(logn)-^, 
pn r 

where the infimum is taken over all estimators pn based on observations 
^1, ■ ■ ■ iXn. 

Using similar techniques, it is expected that lower bounds of the loga- 
rithmic order can be obtained for estimation of 7, a'^ and A as well. Such a 
result is not surprising e.g. for a^, if one recalls comparable results from [9| 
for estimation of the error variance in the supersmooth deconvolution prob- 
lem. Another paper containing examples of the breakdown of the usual root 
n convergence rate for estimation of a finite-dimensional parameter is [22]. 
We do not pursue this question any further. We also notice that the log- 
arithmic lower bounds for estimation of the components of a characteristic 
triplet (under a different observation scheme) were obtained in [5|. 

Our estimation procedure for p in Section [3] relies on the assumption that 
the random variable X has a density (the latter is ensured by the condition 
(7 > 0). If o" = 0, then an approach of [18] may be used for estimation of 
p. For completeness purposes, however, we will show that the lower bound 
for the minimax risk in this case is not logarithmic as in Theorem 14. H but 
polynomial. 

Theorem 4.2. Let T denote a collection of Levy triplets T = (7,0,/?), such 
that I7I < r and A G (0, A]. Furthermore, let (pf satisfy ^^ for (3 > 1/2. 
Then 

inf supMlSE[p„] > n-2/^/ (2/3+1)^ 

pn r 

where the infimum is taken over all estimators pn based on observations 
Xi, . . . ,Xn- 

This theorem in essence says that estimation of the Levy density p in the 
case a = seems to be as difficult as e.g. nonparametric density estimation 
based on i.i.d. observations coming from the target density itself, see e.g. |32l |. 
This result has a parallel in [51]. In absence of the corresponding upper bound 
for estimation of p nothing can be said about how sharp the lower bound 
in Theorem 14.21 is, but in any case the polynomial minimax convergence 
rate seems to be natural. An upper bound of order Ti-f'e'^/l^p+i) j-^g^g i^ggn 



obtained in the compound Poisson model in 1^ for the mean integrated 
squared error when estimating xp{x) under the condition that the class of 
Levy densities is a Sobolev class E(/9, C). 
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5 Proofs 

We first prove the following technical lemma. 

Lemma 5.1. Let the sets Bn and B^ he defined by Q. Suppose Conditions 
\2.5\ and \ 2.6] hold. Then there exists an integer uq, such that on the set B^ 
for all n > no we have 



max{min{M„,log(|(/)emp(t)|)}, -M„} = log(|0emp(t)|) 



(15) 



for t restricted to the interval [—h ^,h ^] and for all p £ W{I3, L, A, K), a G 
(0, S] and \^\ < T. Furthermore, 



max{min{M„,arg((^emp(t))}, -M„} = aTg{(pempit)) 



(16) 



fort restricted to the interval [—h~^, h~^] and for all p E Wsym{f3,L,A, K),a G 
(0, E] and \^\ < T. Here arg denotes the imaginary part of the distinguished 
logarithm of (j)empit), i.e. a continuous version of its argument, such that 
argcpempiO) = 0. 



Proof. Formula (jlSp can be seen as follows: 



log(|(/.e„.p(t))||<|log(|(/>x(t)|)| + 



<|log(|(/>x(t)|)| + 



log 



■'emp 



(t) 



H{t) 



■'emp 



it) 



(t^xit) 



+ 



y^empv^) 



<Px{t) 



(17) 



<|log(|(/>x(t)|)| + - 

Here in the third line we used an elementary inequality | log(l + 2;) — zj < \z\ 
valid for \z\ < 1/2 and the fact that on the set B^ we have 



(Pemp\'^) 



<t>x{t) 



1 



< 



Yempy^) 



OX 



(t) 



< 



1 



(18) 



while in the last line we used the bound | log |0x(i)|| < 2A + T?/{2h'^). The 
equality ()15p now is immediate from Conditions 12.51 and 12.61 because the 
upper bound for | log(|i;^emp(i)|)| grows slower than M„. Next we prove (fTHl) . 
The symmetry of p implies that (/>p is real- valued and hence arg((/>x(i)) = 0- 
On the set B^ we have | aig{(t)emp{t))\ < 2tt, because the path (pemp{t) cannot 
make a turn around zero on this set. This proves (fT6]) . since M„ diverges to 
infinity. D 

Now we are ready to prove Theorems l2.1H3.1i 
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Proof of Theorem \2.1\ Write 

E [{i^l - a^f] = E [{dl - a^flB^ + E [{al - a^flB^^ = 1 + 11, 
where the set Bn is defined as in ([9]). For / we have 

2 



I < I m: 



< M' 



U'^'' W'{t)\dt\ +sMp(s„) 



1/h 
-1/h 



\v''it)\dt] +S^ 



,S2//l2 



n 



/l2 



Mnh^(f \v{t)\dt] +S^ 



^S^'/h^ 



nh^ 



where we used Theorem 12.31 to see the second hne. Observe that under Con- 
ditions [23] and [2]6] the last term in the above chain of inequahties converges 
to zero faster than h?l^~^^ . Now we turn to //. On the set S^, for n large 
enough, truncation in the definition of a^ becomes unimportant, see Lemma 
15.11 and we have 



/I = E 



E 



i/h 

-1/h 
l/h 
■1/h 



\og{\(l)emp{t)\)v^{t)dt - a 
empXj') 



1 



B"^ 



log 



(l)x{t) 



1/h 
1/h 



v''{t)dt+ log{\cl)x{mv''{t)dt-a^] Ibc 



Using this fact, ^ and an elementary inequality (o + 6)^ < 2(0^ + 6^), we 
obtain that 



n<A^ f'^\{4>f{t))v\t)dt 
\J-i/h . 



+ E 



1/h 



log 



i-i/h 
III + IV. 



H^empV') 



OX 



it) 



^v^\t)dt] lac 
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For III we have 






l/h 



^^/mm^it 



\ti'm{4,!mdt 



oo 






where in the second hne we used Condition 12.41 to obtain the third hne 
we used the fact that \^{(j)f{t))\ < 10/(^)1 + |(/>/(— 1)|, while the fourth hne 
fohows from Condition 12.11 We turn to IV. We have 



IV <E 


/ rl/h 
\J-l/h 


(PempV') -1 

4>x{t) 


v' 


\ 

\t)dt 
J 


2 




+ E 


it'"' 

\J-l/h 


H 


Yemp(t) 

4>x{t) 


\ f 

J V 


(Pempv'j 
Cpxit) 



1 } v\t)dt Ibc 



= V + VI. 

Some further bounding and an apphcation of the Cauchy-Schwarz inequahty 
give 



y < g4A+S2//>2 



l/h 
l/h 



{v^{t)fdtE 



.l/h 

/ \<Pemp{t) - <Px{t)\' 

J-l/h 



dt 



Parseval's identity and Proposition 1.7 of 3l| appHed to the sine kernel then 
yield 



E 



l/h 



'l/h 

whence 



^emp{t)-<t>x{t)\^dt 



2ttE 



l/h 



-l/h 



iqn{x) -E[qnix)]fdx 



nn 



n 



As far as VI is concerned, using (fT8|) . an elementary inequality | log(l + z) — 
z\ < l-^l^, valid for \z\ < 1/2, and the Cauchy-Schwarz inequality, we obtain 



16 



that 



VIS. 



i/h 



l/h 



{v^{t)fdtE 



i/h 



YempV^) 



1 
< - 

- 4 



l/h 



l/h 



{v^{t)fdtE 



l/h 
l/h 

-l/h 



'x(t) 

Yemp\J) 



< gSV/.^ 



l/h 
-l/h 



iv^{t)fdtE 



(t>x{t) 

l/h 



1 



dtlB: 

2 

dt 



l/h 



\4>empit) - 4>xit)\ dt 



Hence VI can be analysed in the same way as V. From the above bounds 
on V and VI it also follows that IV is negligible in comparison to ///. 
Combination of all these intermediate results completes the proof of the 
theorem. D 

Proof of Theorem \2.2l The proof is quite similar to that of Theorem 12.11 
Write 

E [(A„ - A)2] = E [(A„ - XflBj + E [(A„ - X)Hbc] =I + II. 

By an argument similar to that in the proof of Theorem 12.11 



I<{Mli^JHt)\dt^ +A2 



^T^/h'^ 



nh? 

This is negligible compared to /i^^+^. Now we turn to //. We have 

2 " 



17 = E 



<A 



+ E 




\og{\(t)emp{t)\)u^{t)dt-X\ 1b,c 
^empKj'j 



{log 



^X 



{t) 



+ \og{\<Px{t)\)}u%t)dt-\\ 1 



B" 



^{4>f{t))u''{t)dt 
emp\'^) 



log 



5X 



(t) 



u^{t)dt 1 



J3<^ 



Here in the third line we used Q. Similar as we did it for III in the proof 
of Theorem 12. H one can check that in this case as well III < h?!^^"^ . As far 
as IV is concerned, it is of order e ' n~^, which can be seen by exactly 
the same reasoning as in the proof of Theorem 12.11 Combination of these 
results completes the proof of the theorem, because under Condition 12.51 the 
dominating term is III. D 
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Proof of Theorem \2.3[ By Chebyshev's inequality 



P(i?n) < ^E 



sup \(t}emp{t) - (l)x{t)\ 



Thus we need to bound the expectation on the right-hand side. This will be 
done via reasoning similar to that on pp. 326-327 in [9|. For all unexplained 
terminology and notation used in the sequel we refer to Chapter 2 of [33] • 
Notice that 



E 



sup \(t)emp{t) - (t)x{t)\ 

ie[-h-\h-^] 



1. 



-E 



n 



sup \GnVt\ 



Here GnVt denotes an empirical process defined by 

1 " 



where the function vt : x ^-^ e . Introduce the functions f ^ : x i— > cos(tx 
and v^:xh^ sin(ix). Then 



E 



sup \GnVt\ 

.tG[-/i-l,fc-l] 





"/ 




\2" 


<E 


sup 


\Gnvl\ 






\tel-h-^,h-^ 




/ 




' / 


\ 


2" 


+ E 


sup 


\GnV^\ 






\te[~h-^,h-^] 


/ 





As it will turn out below, both terms on the right-hand side can be treated 
in the same manner. Observe that the mean value theorem implies 



\v\(x) -v\{x)\ < \x\\t- s\ 



(19) 



for i = 1,2, i.e. v\ is Lipshitz in t. Theorem 2.7.11 of [33l | applies and gives 
that the bracketing number A^n of the class of functions F„ (this refers either 
to v\ or v1 for |t| < /i~^) is bounded by the covering number N of the interval 
/„ = [-/i-i,/i-i],i.e. 



iV[](2e||x||L,(Q);F„;L2(Q))<iV(e;/„;|-| 
Here Q is any discrete probability measure, such that ||x| 



L2(Q) 



> 0. Since 



iV(e ||x||i^^(Q) ;F,;L2(g)) < iV[](2e ||x||l,(q) ;F,;L2(Q)), 
see p. 84 in [33y, and trivially 

iV(e;/„;|-|)<^^, 
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we obtain that 
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N{e\\x\\j^^^Qy,¥n-M{Q))<-^^. (20) 

Define J(1,F„), the entropy of the class F„, as 

J(l, F„,) = sup y {1 + log(A^(e ||x||i^^(Q) ; ¥nM{Q)))V^^de, 

where the supremum is taken over all discrete probability measures Q, such 
that ||x||l (q) > 0. Since F„ is a measurable class of functions with a mea- 
surable envelope (the latter follows from (fT9|) ). by Theorem 2.14.1 in [33] we 
obtain that 



E 



sup 



in 11*1 



< 



|2^(P)(J(l,F„))^ 



where the probability P refers to F^^^2p . Now notice that 



\x\ 



L2(P) 

,Af(A) 



E[{-f + Y + aZf]<-f^+E[Y^]+a^, 



where Y := ^j^i Wj denotes the Poisson sum of i.i.d. random variables 
Wj with density /, while Z is & standard normal variable. Under conditions 
of the theorem the term 






xf{x)dx I + A / X f{x)dx 



9 

is bounded uniformly in p. Hence \\x\\i^ (p^ is also bounded uniformly in p,a 
and 7. Using (pOj) . the entropy can be further bounded as 

1/2 



ja,F„)<^'{i + iog(Hi)} *. 



Here we implicitly assume that n is large enough, so that we take a square 
root of a positive number. Working out the integral, it is not difficult to check 
that J(1,F„) = 0{h^^). Combination of these results yields the statement 
of the theorem. D 



Proof of Theorem \2.4\ Again, the proof is quite similar to that of Theorem 
[211 Write 

E [{in - if] = E [{in - 7)'1bJ + E [{in " 1?Ib{\ =1 + 11. 

For / we have 

I < Ul^h' 1^1'^ \w{t)\d?j + tA P(i?0. 



19 



Thanks to Theorem 12.31 the right-hand side converges to zero as n — > oo. 
Moreover, it is neghgible compared to /i^^"*"^. Next we turn to II. By Lemma 
15.11 on the set B^ for n large enough truncation in the definition of 7„ 
becomes unimportant and we have 



11 




<A2e 



+ E 



The same reasoning as in Theorem 12.11 shows that here as well /// is of 
order /i^^^^. As far as IV is concerned, the inequality |9(^;)| < 1^1 implies 
that 



/T/<E 




Log 



(t>x{t) 



w^{t)dt 1 



B" ■ 



Now notice that on the set B^ the inequality 



holds, cf. formula (4.8) in [13] ■ Therefore 
IV <E 

+ E 



< 



-'emp 



(t) 



Hit) 



(21) 




w^{t)dt 



w^{t)dt 




Just as for IV in the proof of Theorem 12. 1^ one can check that in this case 
as well IV is negligible in comparison to III. Combination of these results 
completes the proof of the theorem. D 

Proof of Theorem \2.5[ The proof follows essentially the same steps as the 
proof of Theorem 12. 4[ The only significant difference is that we have to 
verify that there exists an integer no, such that on the set 5^^ for all n > tlq 
truncation in the definition of 7„ is unimportant for an arbitrary p satisfying 
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conditions of the theorem, and not necessarily for a symmetric p as in Lemma 
15.11 To see this, first notice that 



9(Log(0emp(t))) = 
9(Log((/>x(t))) = 

Let -(/) : R — > C, where 



9(Log(e^e'^'*'/2, 



0emp(i))) 



9(Log(e'^e'^'/^0x(i))) = 9(e>/(t)). 



(px{t)e^e 



A *2/2 



=A</./(t) 



By the Riemann-Lebesgue theorem ^(t) converges to 1 as |f | 
there exists i* > 0, such that 



cxD and hence 



m) - 1| < 



\t\ > t* 



(22) 
(23) 



Furthermore, we have 

|V'(OI > e~^, t GM. 

Since / has a finite second moment, by Theorem 1 on p. 182 of ^2^] the 
characteristic function (j)f is continuously differentiable. Consequently, so 
is the exponent ^/J. Therefore the path ■0 : [— t*,t*] ^ C is rectifiable, i.e. 
has a finite length. In view of this fact and (I23p . ■0 : [— t*,t*] — > C cannot 
spiral infinitely many times around zero (because otherwise it would have 
an infinite length) and for |t| > t* it cannot make a turn around zero at 
all because of (j22p . Since M„ diverges to infinity, it follows that for every 
UJ £ B,^^ there exists no(u;), such that h~^ > t* and for all n > no(i 



UJ 



max{min{M„, 9(Log((/)emp (*)))}, -M„} = 9(Log(0emp(t))). (24) 

However, it is easy to see that in fact there exist a universal integer no, such 
that (f2l]) holds for all oj G B'^h- J^^* notice that for each uj the number 
of turns that (f)empit) makes around zero is determined by the number of 
turns m that '(/'(t) makes around zero and cannot be greater than 2m, say. 
Consequently, there exists a universal bound 4m7r on Q{hog{(j)empit))) valid 
for all UJ £ -B^^ . This concludes the proof of the theorem. D 



Proof of Theorem lA'. J[ We have 



E 



\Pn{x) - p{x)\'^dx 



E 



\Pn[X) 



— oo 
oo 



where Bn and B^ are defined by 

\pn{x)- p{x)\'^dx< 



+ E 

= 1 + 11, 

3). Notice that 

\pnix)\'^dx + 



- p{x)\^dxlB„ 
p{x)\ dxl 



Lgc 



\p{x)\'^dx. 
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By Parseval's identity and Condition 12. II 

\p{x)\^dx<l. 

'J 

For the Fourier transform of pn we have 

\^Pn{t)\<Mnll-h-\h-^{t)- 



Hence by Parseval's identity 



\pn{xtdx < M^-. 



Using this and Theorem 12.31 we get that 

/ < <! Afn^ + 1 



1 . 1 e^V/^^ 



h 



n. 



/l2 ■ 



Under Conditions 12.51 and 12.61 the latter is negligible in comparison to h'^^. 
Now we turn to //. By Parseval's identity 



//= — E 

2tt 



hnit)-Mt)\^dtiB^„ 



1 

2^' 



E 



oo 
1/h 

-1/h 



hnit) - Mi)\^dtlBc 



+ 



27r 



-/i-i,/i-i) 



M'dtPiB^) 



= III + IV. 
For IV we have 



IV < [ \(t)p{t)\'^dt = X'^ [ 1^2/3 

jR\(-/i-i,h-i) JR\(-fe-i,fe-i) 



\Mt)\' 

1*2/^1 



■dt 



where the last inequality follows from the definition of the class W*^{(5, L, C, A, K). 
Next we turn to ///. With (fTSl) and (fT6l) we have that 





1 


ri/h 


III = 


E 






27r 


J-i/h 



6p„(i)-</'pWl'dilBS 
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for all n large enough. Consequently, 



/// < E 



+ E 



[cr„ 



a 



2\2 



l/h 



t'^dtl 



B'^ 



-l/h 



l/h 



+ E 



+ E 



l/h 
(An - \? 



Log((/.e™p(t))-Log(0x(t))| 1 



_B<= 



l/h 

-l/h 
l/h 

-l/h 



t^dtlB'^ 



dtl 



B'^ 



= IV + V + VI + VII. 
For IV we have by Theorem 12. II that 



'""^h^ 



[ol 



-'f^B, 



0{h^^+'] 



As far as V is concerned, by the inequality (|21|) 



V<E 



l/h 
l/h 



y^emp\J) 



(t)x{t) 



1 



2 




" r^/h 


dtlsc 


+ E 


/ 






J-i/h 



YempyT^) 

(Px{t) 



1 



dtl 



B" 



The right-hand side can be analysed similar to V in the proof of Theo- 
rem [27T] and in fact it is negligible in comparison to h?^ . Furthermore, by 
Theorem 12.41 VI is of order /i^^"*"^. Also VII is of order /i^^"*"^ by Theorem 
12.21 Combination of all the intermediate results completes the proof of the 
theorem. D 

Proof of Theorem \3.2l The proof uses the same type of arguments as that 
of Theorem 13.11 The only essential difference is to show that there exists 
no, such that on the set B^j^ for all n > no we have /0„(a;) = Pn{x). We 
therefore consider in detail only this part of the proof. For arg((/)emp(i)) 
the corresponding argument was already given in the proof of Theorem 12.51 
Thus we only have to prove that 

max{min{M„,log(|(/)emp(t))}, -M„}1bc^ = log(|(/)emp(t)|)lB^^- 

The latter can be shown by exactly the same arguments that were used in 
the proof of (jlSp in Lemma 15.11 D 



Proof of Theorem \4-l\ The proof makes use of some of the ideas found in 
[lO, ll9|. Consider two Levy triplets Ti = (0, a^,pi) and T2 = (0, (T^,p2), 
where Pi{x) = Xfi{x),i = 1,2 and A < A. Let 

fiix) = 7;{n{x) +r2(x)). 
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where the probability densities ri and r2 are defined via their characteristic 
functions, 

With a proper selection of /3i, /32, oi and a2 one can achieve that /i satisfies 
()14p with a constant C/4 (instead of C). We also assume that 1 < 02 < 2. 
Notice that ri is a bilateral gamma density, while r2 is a stable density. To 
define /2, we perturb /i as follows: 

where 5^ — *■ as n — > 00, and the function H satisfies the following condi- 
tions: 

1. fz\tn^H{trdt<c/4; 

2. f^^H{x)dx = 0; 

3. j'^^Hix)dx^O; 

4. (PH{t) = for t outside [1,2]; 

5. (pnit) is twice continuously differentiable. 

To see why such a function exists, see e.g. p. 1268 in [19]. It is also obvious, 
that there are many functions H with an appropriate tail behaviour, such 
that f2{x) > for all x S R, at least for small enough (5„. With such an H 
and small enough 5n, the function /2 will be a probability density satisfying 
(dH). Notice that 

roo 

{P2{x)-pi{x)fdx^5f. (25) 



Here the symbol x means 'asymptotically of the same order'. Denote by qi 
a density of a random variable X corresponding to a triplet Tj, i = 1, 2. The 
statement of the theorem will follow from (I25p and Lemma 8 of 10(] , if we 
prove that the x^-divergence (see p. 72 in [31] for a definition) between §2 
and qi satisfies 

2t . r iQ2{x) - qiix))\ 

nx iQ2,qi) = n / -^ dx < c, (26) 

J -00 Qi{x) 

where a positive constant c < 1 is independent of n. 

Let gi be a density of a Poisson sum Y conditional on the fact that 
its number of summands A^(A) > 0. Here the index i refers to a triplet 
Ti,i = 1, 2. Since 

ct>Y{t) = e-^ + (1 - e-^)^-i- fe^<^AW - l) , (27) 



e^ 
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it follows that 
We also have 

oo 

9iix) = 5^ /r(x)P(iV(A) = n\N{\) > 0). (28) 

n=l 

From (|27p we obtain 

qi{x) > (1 - e"^)(?:>o,<x2 * 51(3;), 

where (J)q 0-2 denotes a normal density with mean zero and variance a'^ . More- 
over, by Lemma 2 of ^], there exists a large enough constant A, such that 
the right-hand side of the above display is not less than (1 — e~'*')g'i(|x| +A). 
Hence 



9i(i"^i+-4) ~ ;_„ fi(u\ + A} 



where the last inequality follows from (|28p . Splitting the integration region 
into two parts, we then get that 

nX^{Q2, qi)<n I {q2{x) - qi{x)fdx + n x'^{q2{x) - qi{x)fdx 

J\x\<A J\x\>A 

= 1 + 11. 

Here we used the fact that /i(x) behaves as |2;|~-^~"2 ^^ pi^g ^nd minus 



infinity, see e.g. formula (14.37) in [27|, and that 1 < 02 < 2. Since 



^^-1/2 / e^*-^(2./5jrf^ = 6^+'/^cl>H{6nt), 



by Parseval's identity it holds that 

1 r°° 



2x2 



n^—— '- / |0g,(t) - ^g,{t)Ye-'' * dt 



(1 — e"'*'"'^ ''°^ 

J —00 
-A\2 1 /-oo 

n^. i^— / |e^<^/2W - e^<^/iWpe-- * di 



' (e^ - 1)2 2^ 

/oo 



00 



where the last inequality follows from the mean-value theorem applied to 
the function e^ and the fact that |A0j. (t)| < A. By definition of /i and /2 
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we then get that 



/oo 
\^HiSnt)\'e-'^"'"dt 
-oo 
/oo 
-oo 



The choice 5n >^ (log n) ^'"^ with smah enough constant wih now imply that 
/ ^ as n — > oo. Next we turn to II. By Parseval's identity 



II < n- 



1 

'2^ 



\icPg,it)-<Pg,it))rdt. 



Here we use the fact that even though (pj-^ and (pf^ are not twice differentiable 
at zero, the difference 4>q2{t) — (pqiit) still is, because (pn is identically zero 
outside the interval [1, 2]. By exactly the same type of arguments as we used 
for I, one can show that II ^ as n ^ oo, provided (5„ x (logn)""*^'^. 
Hence (1261) is satisfied and the statement of the theorem follows. D 



Proof of Theorem \4-^ The proof is similar to the proof of Theorem 14.11 
Let pi{x) = Xfi{x) with /i as in the proof of Theorem 14.11 Consider a 
perturbation of pi, say P2{x) = A/2(x), where /2 is defined as in Theorem 4.1. 
Assume that the function H in the definition of /2 has a compact support on 
[—1,1] and that it satisfies Conditions 1-3 in the proof of Theorem l4.1[ This 
implies that f2ix) > for 5n small enough. Therefore p2 is a Levy density 
satisfying ^^, provided (5„ is small enough. Denote by Pi„ and P2n the 
laws of a Levy process X = {X)t>o restricted to the time interval [0, n] and 
corresponding to the characteristic triplets Ti = (0, 0, pi) and T2 = (0, 0, ^2), 
respectively. Notice that 



inf sup E 

Pn T 



(Pnix) - p{x)fdx 



> inf sup E 

Pn 7- 



(Pnix) - p{x)fdx 



(29) 

where pn denotes an arbitrary estimator based on a continuous record of 
observations oi X over [0, n]. Let K{P,Q) denote the Kullback-Leibler di- 
vergence between the probability measures P and Q, 



K{P,Q) 




ifP<.Q, 

if otherwise, 



see Definition 2.5 in [31]. In view of (I25p . the result will follow from formula 
(|29|) above, the arguments of Section 2.2 of 31[ combined with Theorem 
2.2 (iii) of [31(, provided the Kullback-Leibler divergence K{F2n,^in) be- 
tween the measures P2n and Pi„ remains bounded for all n by a constant 
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independent of n. The Kullback-Leibler divergence between P2n and P 



In 

can be easily computed via Theorem A.l of ij], which in our case gives 
that K{F2n,^in) = 'nK{p2, pi), because both pi and p2 have the same total 
mass. Let x'^{P2,Pi) denote the x^-divergence between the densities p2 and 
pi. It is not difficult to see that K{p2,pi) < X^{P2,Pi), cf. formula (2.20) 
in 311]. It follows that in order to prove the theorem, it suffices to show 
that X^(P2)Pi) = 0{n~^). By definition of pi,p2,H and a change of the 
integration variable we have that 

x\p2,Pi)<Sf^' f'^-^P^du. (30) 

The dominated convergence theorem implies that the right-hand side of the 
above equation is of order 6n ■ Taking 6n x n~^'^'^^^^' gives that (j30p is 
of order n~^. This yields the statement of the theorem. D 
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